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Abstract —We study the maximum sum rate optimization prob¬ 
lem in the multiple-input multiple-output interfering broadcast 
channel. The multiple-antenna transmitters and receivers are 
assumed to have perfect channel state information. In this 
setting, finding the optimal linear transceiver design is an NP- 
hard problem. We show that a reformulation of the problem 
renders the application of generalized Benders decomposition 
suitable. The decomposition provides us with an optimization 
structure which we exploit to apply two different optimization 
approaches. While one approach is guaranteed to converge to 
a local optimum of the original problem, the other approach 
hinges on techniques which can be promising for devising a global 
optimization method. 


I. Introduction 

Finding the maximum sum rate operating point in the 
multiple-input multiple-output (MIMO) interference channel 
is an open problem. It is proven that for the case of perfect 
channel state information at the transmitters and the receivers, 
the problem is NP-hard even for the single receive antennas 
case m. Seeking the optimal solution is of paramount im¬ 
portance when evaluating low complexity distributed schemes 
such as in El, 0- 

An approach to simplify the problem is to characterize 
the set of necessary transmission strategies for Pareto opti¬ 
mal operation for each user independently 0. However, the 
parametrization in a reveals that the search space remains 
huge for finding the jointly sum rate optimal transmission 
strategy. Another approach in 0 parameterizes the necessarily 
optimal transmission strategies by Lagrangian multipliers and 
shows an uplink-downlink duality in the setting. Such char¬ 
acterization is useful for devising heuristic algorithms which 
can efficiently update the Lagrangian multiplies. 

Alternating optimization methods which reach local optima 
of the sum rate optimization problem are reported in 151, 171 
and recently in J5J for the case of hardware impairments at 
the transmitters and receivers. The mentioned approaches rely 
on optimization of the weighted minimum mean square error 
(MMSE) by successive optimization of the transmit covariance 
matrices, receive covariance matrices as well as a weighting 
matrices. Such methods can be implemented in a distributed 
fashion whenever it is possible to exchange the weighting 
matrices between the transmitters and receivers. 


A technique which helps in finding global solutions in prob¬ 
lems with special couplings between the optimization variables 
is the generalized Benders decomposition. This decomposition 
method has been applied for control problems with bilinear 
inequality constraints in 0 and generalized to bilinear matrix 
inequality constraints in ffTOll . 

We show that the maximum sum rate problem can be cast 
in a form with bilinear matrix constraints suitable for the 
generalized Benders decomposition. In doing so, we reveal a 
structured way to approach the solution of the NP-hard opti¬ 
mization problem. Consequently, we provide two optimization 
approaches. The first approach is based on the work in ED, 
lfl2l which leads to a local optimum of the original problem. 
The second approach utilizes a novel approximation on the 
upper bound of the sum rate. Although convergence to a 
stationary point of the original problem is not verified in the 
second approach, simulation results illustrate its efficiency. 

Outline: After describing the system model and formulating 
the sum rate maximization problem in Section [II] we make the 
necessary problem reformulation to cast the problem using 
generalized Benders decomposition in Section III Exploiting 
the decomposition, two approaches to deal with the original 
problem are provided in Section IV Discussion of the results 
using numerical examples are given in Section [V] before we 
draw the conclusions in Section [VTl 

Notations: Column vectors and matrices are given in low¬ 
ercase and uppercase boldface letters, respectively. ||-||, |-|, 
and (-)t denote respectively the Euclidean norm, absolute 
value, and Hermitian transpose. I is an identity matrix. Define 
the collection {a} := (ai,..., <Z|;q). CA/”(0,A) denotes a 
circularly-symmetric Gaussian complex random vector with 
covariance matrix A. 


II. System Model and Problem Formulation 
Consider a set of cells K. = {1,... ,K}. The base station 
in a cell k serves a set of users Uk- We assume that the base 
stations and the users use multiple antennas. Let rik and rriki 
be the number of antennas at base station k and user i in cell 
k, respectively. The flat fading channel matrix from a base 
station j to a user i £ Uk in cell k is H jki € . 

The received signal at a user i £ Uk is 
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Yki - (1) 

where the precoded signal x ; , £ C'" ,J has zero mean and 
covariance EjxjjX^} = Xjj, and z^.j ~ CJV(0,Ia 2 ) is 




additive white Gaussian noise. 

We assume that the maximum transmission power at a base 
station k is restricted to P k and define the set of feasible 
transmission strategies of a base station k a^] 

x k = {(x fcl ,...,x fc | Wfc |) I Xfei € c nkXn \x ki y 0, 

for all i £ U k , 'V' tr(X fci )<P fc |. ( 2 ) 

z — 'i&Jk ) 

Define the set of all feasible transmit covariance matrices 
of all base stations as 


X = X x x ■■■ x X K . ( 3 ) 


For a given X £ X and assuming linear MMSE decoding, 
the achievable rate at user i £ U k is written as 


P fei (X) = log 2 




( 4 ) 


where the interference and noise covariance matrix is 


Z/ti —a~I + H kk iX k iH\. ki 

ieu k \{i} 

S. S 

intracell interference 

+ E E H ^ x *' H }w- (5) 

jeic\{k} I’eUj 

intercell interference 

We are interested in finding the maximum sum rate operat¬ 
ing point which is a solution of the following problem: 

max E ^fci(X) s.t. Xef. ( 6 ) 

fcG/C iGZ-Yfc 

Problem <[ 6 ]> is NP-hard (T|. 

III. Problem Reformulation 


In this section, we will provide an equivalent formulation 
of the maximum sum rate problem in ([6]), which will enable 
the application of the generalized Benders decomposition. The 
reformulation is based on the following lemma. 

Lemma 1: Define 


R kl (X,Y ki ) = log 2 


J ki 




+log 2 |Y fci |. ( 7 ) 


Then, 

(i) R ki {X) > R ki (X, Y fei ) for 0 A Y fcj and Y ki A Zj)/, 
with equality if Y ki and Z^ 1 have the same eigenvalues. 

(ii) R ki (X, Y k i) is jointly concave in X and Y k i. 

Proof: The proof is provided in Appendix [A] ■ 

We can formulate the following sum rate problem using the 
lower bound on the achievable rate in | 7 ]i as 

max max E E ^«( X ’ Y ki ) (8a) 

* ‘ kGKieU k 

s.t. X£X, (8b) 

Z k ,Y ki fl, i £ U k ,k £ /C, (8c) 

Y k i A 0, i £ lA kl k £ 1 C. (8d) 


'More general linear power constraints can be also adopted similar to the 
model in 1131 Section II.A]. 


By using Lemma [I] the equivalence of the Problem [8] and the 
original problem in ([6) can be established. 

Theorem 1: Problem ([8]) and Problem |6) are equivalent. 

Proof: For any X £ X, an optimal value of Y ki is Z^ 1 
for alii £ U k , k £ K. following (i) in Lemma [I] Accordingly, 
with Y k i = Zfl for all i £U kl k £ 1 C, the objective functions 
as well as the constraint sets of Problem ([8]) and Problem (|fi]) 
are identical. ■ 

In Problem ([8]), the objective is jointly concave in the 
optimization variables according to (ii) in Lemma [I] and the 
constraints ( |8b| and ( |8d| ) are convex. The non convexity of 
Problem ([8]i is due to the bilinear matrix inequality constraints 
in ( |8c| . Problem however, has the following significant 
property: If the variables {Y} are fixed, then Problem ([8]i is 
convex in X. Also, if we fix X, then the problem is convex in 
{Y}. For non convex problems with such attributes, a method 
called Benders decomposition has been proposed in fl 4 l for 
linear programs and generalized in 021 to a wider class of 
problems utilizing nonlinear duality theory. 

A. Generalized Benders Decomposition 

The main idea of the generalized Benders decomposition 
is the projection of the problem onto the space of a specific 
set of variables m leading to inner and outer optimization 
problems. In our case, the projection of Problem <0 in the 
space of the X variables reformulates Problem ([8]) to 


max v(X) s.t. X £ X (T V, 

X 

( 9 ) 

where v(X) is defined by the inner optimization, 
the primal problem: 

also called 

t;(X)=max £ £ £ fci (X, Y fci ) 

* ‘ kGlCi&Uk 

(10a) 

s.t. ZfciYki ^ I, i £ Ltki £ ^5 

(10b) 

Yh 0, i £ M k ,k £ IC, 

(10c) 


and 


V = {X | Zfc, Y /.,j A I for some Y ki y 0, i £ U k , k £ K.} . 

( 11 ) 

The set X D V in ([ 9 ]) represents the projection of the feasible 
region of Problem onto the X variable space. This is 
generally important in order to ensure feasibility of the primal 
problem in <0®. However, in our case, the set V in 
includes the set X since it is always possible to find a feasible 
set of variables {Y} for any X £ X. Hence, the primal 
problem in ([TO]) is always feasible and we can consequently 
remove the constraint associated with V. 

As Problem <[ 9 ]> is difficult to solve, the inner maximization 
problem in ( [T()| is considered in its dual form. 

The Lagrangian of the primal problem in ([TO]) is 

£(X,{Y},{r}) = 

YE k i(X,Y ki )+ti(T ki (I-Z ki Y ki )), ( 12 ) 










where {T} are Lagrangian multipliers associated with con¬ 
straints ( |10b| i. Since strong duality holds for the inner maxi¬ 
mization problem. Problem <[9]> can be reformulated as 

(13a) 

£(X,{Y},{r}), (13b) 

r fci ^o, i£U k ,k£tC, (13c) 

Yfci h 0, t € Wki k £ 1C, (13d) 

(13e) 

By using the definition of a minimum. Problem © can be 
reformulated to what will be called the master problem: 

(14a) 

£(X,{Y},{r}), (14b) 

for all T ki Y 0,i £U k ,k £ 1C, 

Yfci ^ 0, i € U k , k £ 1C, (14c) 

(14d) 

The form of Problem ( fl4| ) is convenient for two optimization 
approaches which we address next. 

IV. Optimization Approaches 

Optimization methods which utilize the generalized Benders 
decomposition alternate between solving the primal problem 
in m an d the master problem in ( fl4| >. The solution of the 
primal problem is used as a lower bound on the maximum 
sum rate, while an upper bound is characterized by the master 
problem. However, the master problem in <H3 is hard to 
solve due to the existence of the inner optimization problem 
(fl4bl>-(fl4c]l. Two methods which deal with this problem are 
presented in the next subsections. 

A. Approach 1 

This approach is based on the work in dh, dd and 
is described using Algorithm |T| In each iteration t of the 
algorithm two problems are solved. First in Line 3, for fixed 
variables X*” 1 (obtained from iteration t — 1) the convex 
primal problem in ( fT0| > is solved to obtain {Y 1 } and a set 
of duals {r'} associated with constraints ( |10b| ). The second 
optimization in Line 4 uses the solution of the primal problem 
as fixed inputs for the master problem problem in ( p~4] ) to 
obtain X*. Notice that for fixed {Y f } and {T*}, the master 
problem problem is convex in X. 

In each iteration t of Algorithm [T] the following is satisfied: 

£(x i ,{Y t },{r f })> Y, E^(X‘.YL). (15) 

fcG/C i&Jk 

whose proof follows similar lines as the proof in DU Appendix 
B]. That is, the solution of the master problem is always larger 
than the solution of the primal problem in Algorithm |T| The 
convergence of Algorithm [I] to a local optimum is guaranteed 
according to lfl2l Theorem 2.5]. 


max v(X) 
x 

s.t. v(X) < max 
{Y} 

S.t. 

xgx. 


max v(X) 

x 

s.t. v(X) = min max 
in {y} 

s.t. 

X G X. 


Algorithm 1 Approach 1 for sum rate optimization. 

Initilize: t = 0, X° £ X. accuracy measure e 
l: repeat 

2: t = t+ 1; 

3: Solve primal problem ( fTO} for fixed X f 1 to obtain 

Y ki and T ki for all i £U k ,k £ 1C. 

4: Solve master problem ( fl4l > for fixed Y ki and T f kl to 

obtain X*. 

5: until Efce/C "Yhi&Ak (7?fci(X ) — -Rfci(X )) < £ 


Algorithm 2 Approach 2 for sum rate optimization. 

Initilize: t = 0, X° £ X, accuracy measure e 
l: repeat 

2: t = t+l; 

3: Solve primal problem ( fTO} for fixed X 1 1 to obtain 

Y ki and T f kl for all i £U k ,k £ 1C. 

4: Solve master problem ( [T4| using ( |LIN-LB| > 

lin—lb 

X* = argmax £(X, {Y}, {F*}) (16a) 

{Y},xeY {Y‘} 

s.t. Y ki y 0, i£Uk,k£lC. (16b) 
5: until E fce ^E ie ^(^( X ‘) -^(X 4 - 1 )) < e 


B. Approach 2 

Similar to Approach 1, the second approach relies on a 
solution for the master problem to obtain an upper bound on 
the sum rate. Based on the method in j9[, a linearization of 
the Lagrangian £(X, {Y}, {F f }) is applied around {Y f }, 
where, similar to Approach 1, {Y f } and {T f } are from 
the optimization of the primal problem in © at the tth 
iteration of the algorithm. The linearization of the Lagrangian 
corresponds to the first-order Taylor series expansion about 
{ Y 1 } as given in ( |LIN[ i at the top of the next page. This 
function satisfies the following (9] Property 4]: 


max£(X,{Y}, {r‘}) < max£(X, {Y}, {r*}) 


lin 


{Y*} 


(17) 


The above inequality ensures achieving an upper bound on 
the sum rate by the master problem ( [T4| with the Lagrangian 
replaced by the linearization. While in a global opti¬ 
mization framework is constructed based on the linearization 
approach, it proves hard to apply the existing techniques in our 
case where the variables are Hermitian matrices. Whenever a 
method can be found to maximize dLlN) in X and {Y}, a 
global optimization scheme can be constructed. 

In this work, we will use an approximation on the lineariza¬ 
tion of the Lagrangian in (|LIN|) with the following property. 


Proposition 1: For the expressions given in ( |LIN| > and 
(|LIN-LB| at the top of the next page, the following holds 


lin 


max £(X, {Y}, {r‘}) > max £(X, { Y}, {r*}) 

{Y} L J {Y‘} {Y} 1 J 


lin—lb 

{Y*} ' 
( 18 ) 





























(LIN) 


£(X, {Y}, {r 4 }) = £(X, {Y 4 }, {r 4 }) + ZE tr ( V v^(X, {Y}, {r 4 }) ) (Y kt - Y 4 w ) 

{y 1 } v-/ f—r. \ ye... / 


'CD 




=( Y * )-!-r‘ z fc 


lin—lb ^^ 1 

£(X,{Y},{r 4 }) =£(X,{Y 4 },{r 4 })^ E oll( Y «) - r‘wZ« - (Y fci - Y 4 fci )| 

4 4 kGlCieUk A 


(LIN-LB) 


Proof: The proof is provided in Appendix [B] ■ 

The algorithm for Approach 2 is given in Algorithm [2] 
and alternates between solving the primal problem in ( |T()| to 
obtain a lower bound on the sum rate and solving the master 
problem in ( fl6| to obtain an approximate upper bound on the 
sum rate. Note that ( |LIN-LB| ) is jointly concave in X and 
{Y} and hence Problem ( | 1 6[ i is convex. As in Algorithm [I] 
solving the primal problem in Line 3 for fixed X 4 1 gives 
{Y 4 } and the duals {IT 4 } associated with constraints ( |10b| i. 
These parameters are used for solving ( fl6| which is the master 
problem in ( |T4| ) with the Lagrangian replaced by ( |LIN-LB~| . 

Although simulation results show ensured convergence of 
Algorithm [2] a global convergence proof is hard to conduct 
mainly due to the lower bound property in Proposition |T] 
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C. Complexity 

Using the same notation as in 0, let k be the total number 
of users in the network and R and T be the number of antennas 
at each receiver and transmitter, respectively. The complexity 
in each iteration of Algorithm [T] or Algorithm [2] is dominated 
by calculating the interference and noise covariance matrices 
in 0. and solving the primal and master problems. 

As in the alternating weighted MMSE approach 0, the 
complexity of calculating Z/ ; ., in ([5J for all receivers is 
0(k 2 TR 2 ). The solution of the primal problem in ( |TT)| is 
the inverse of Z k i and thus requires for all users 0(nR 3 ) 
arithmetic operations which is similar to the complexity for 
calculating the decoders in 0. The complexity for solving the 
master problem, which is a convex semidefinite program, is 
hard to quantify due to the dependency on the structure of the 
problem and its representation in the solver. Generally how¬ 
ever, it is known that interior point methods for semidefinite 
programming are very efficient lfl5ll . Having that the calcu¬ 
lation of the precoders in 0 is a quadratically constrained 
quadratic program which can be solved by the more general 
semidefinite programs, we infer that our algorithm has higher 
complexity per iteration compared to the algorithm in 0. 

V. Illustrations 

In the simulation examples, we set the transmission power 
as P k = 1 for all k G 1C, and choose SNR = 1/a 2 = lOdB. 
The accuracy measure for the algorithms is set to e = 10 -5 . 
We use CVX lfT6l with the SeduMi solver liTTl for solving the 
optimization problems within the algorithms. 

In Fig. |Tj the achievable rate region in a setting with two 
cells and one user in each cell is illustrated. The cloud of points 
corresponds to 10 6 randomly generated achievable rate tuples. 


Fig. 1. Illustration of a two-user rate region. The number of antennas used 
at each transmitter and receiver is two. 

Here, we use the algorithm proposed in fl8l to generate the 
transmit covariance matrices with trace constraints satisfying 
0. The outer boundary of the rate region is plotted using 
Algorithm [2] where each point corresponds to a weighted 
sum rate optimization. The weighted performance optimization 
includes weighting factors for each term in the summation in 
the objective of |6) which we have excluded in our model 
due to obvious extension possibility. In the two user case in 
Kg.0 ten weighting factors for user one Ai have been chosen 
uniformly from [0,1] and the weights for users two are set 
as A 2 = 1 — Ai. It can be observed that the high number 
of randomly generated rate tuples is not sufficient to obtain 
points near to the plotted boundary. Hence, for benchmarking 
purposes, an exhaustive search on randomly generated rate 
tuples is not likely to be efficient especially when more than 
two antennas are used at the transmitters and the receivers as 
well as for larger number of users. In Fig. |T] the optimized sum 
rate point is distinguished with a square and star marker and 
obtained by all three algorithms. Note however that non of the 
algorithms is guaranteed to converge to the global optimum 
of ([6]) and currently it is not possible to verify whether the 
obtained point corresponds to the maximum sum rate. 

In Fig. [2] the convergence behavior of our algorithms is 
shown for a setting with ten cells and one user in each cell. 
It can be observed that the upper bound approximation in 
Approach 2 (Algorithm [2| obtained as the solution of © 
is not monotonically decreasing. The upper bound on the 
sum rate in Approach 1 (Algorithm |TJ) is, in comparison 
to Approach 2, much closer to its associated lower bound. 
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- Approach 1: sum rate lower bound 
Approach 2: sum rate approximate upper bound 
Approach 2: sum rate lower bound 
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max sum rate by grid search over 10 6 random samples 





Fig. 2. Convergence behavior for K = 10 cells with one user per cell. The 
number of antennas at each transmitter and receiver is two. 


Despite this fact. Approach 1 needs higher number of iterations 
of 40 according to the example in Fig. [2] The alternating 
weighted MMSE approach 0 shows fast convergence be¬ 
havior to the solution which is also obtained by the other 
algorithms. Grid search for the maximum sum rate over 10 6 
randomly generated achievable rate tuples is shown to be not 
adequate for benchmarking purposes. 

VI. Conclusions 

Sum rate optimization in multicell MIMO is reformulated 
using generalized Benders decomposition. As a result, two 
optimization approaches are applied. Numerical results show 
comparable efficiency of the proposed methods to existing 
optimization methods. Future work will study the possibility 
of finding an exhaustive optimization method for the master 
problem with the linearization of the Lagrangian which could 
aid in constructing a global optimization method for finding 
the maximum sum rate operating point. 
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Appendix 

A. Proof of Lemma [7] 

In item (i), we have to show that f?fc, (X) — Y^) = 

log 2 |Z^ 1 1 — log 2 |Yfcj| > 0. Note that the function log 2 |A| 
with A >- 0 can be equivalently written as tr ( 0 (A)) where 

is a matrix-monotone function Definition 3.1] using 
similar derivation steps as lH9l Example 3.2]. We then have 
log 2 |Y fci | < log 2 | Zfc/1 for 0 x Y ki A Z^ 1 following the 
definition of matrix-monotone functions in lfl9l Definition 3.2]. 
Since a matrix-monotone function </>(A) is affected only by the 
eigenvalues of the matrix A | [l9l Definition 3.1], the equality 
in item (i) is achieved when the eigenvalues of Yand Xf- 
are the same. The proof of item (ii) follows from QS Lemma 


3.10] which states that the trace of a matrix monotone function 
is concave and monotone. 


B. Proof of Proposition [7] 

■ ki^ki 

tr (A 2 + B 2 ) tr (A 2 


Define A := (Y^)" 1 


— rt-Zfei and B := Y fci -Yj7. Then 


tr (AB) = tr (AB) - 
1 

~2 


W 


> - 


tr((A-B) 2 )+itr(A 2 + B 2 ), 


= -^l|A-B|| 2 . 


(19) 

( 20 ) 


( 21 ) 
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